Probability-Enhanced Predictions in the Anticipatory Classifier System
نویسندگان
چکیده
In contrast to common Learning Classiier Systems (LCSs), classiiers in the Anticipatory Classiier System (ACS) have a condition-action-anticipation-payoo structure (Stolzmann, 1998). The learning is based on the accuracy of predicted environmental eeects (i.e. anticipations) rather than on the payoo predictions, as in traditional LCSs, or the accuracy of payoo predictions, as in XCS (Wilson, 1995). Anticipation based learning enables the ACS to learn latently (i.e. learning without getting any reward) a complete internal model of the environment rather than a condition-action-payoo model. Additionally to the evolution of an internal environmental model, the ACS applies reinforcement learning resulting in the formation of a behavioral policy. The evolving environmental model enables the ACS to use more sophisticated processes (i.e. cognitive processes) to solve more diicult tasks, speed up learning, and reene its behavioral policy. Stolzmann, Butz, Hoomann, and Goldberg (2000) investigate the present cognitive capabilities. Figure 1 visualizes the learning in the ACS. Fig. 1. A behavioral act in the ACS with all the involved learning procedures Similar to all LCSs, the ACS evolves a population of classiiers. The main learning mechanism is the Anticipatory Learning Process (ALP) which is derived from a cognitive learning theory. It generates oospring classiiers by specializing inaccurate, more general ones. The recently introduced genetic generalization (Butz, Gold-berg, & Stolzmann, 2000) evolves accurate, maximally general classiiers out of the partly over-specialized classiiers created by the ALP. Additionally , the reinforcement learning component, which is a combination of the Bucket-Brigade and Q-learning, evolves a behavioral policy in the ACS. In order to be able to learn an internal environmental model, the ACS relies on a causality in successive perceptions of an environment. Any non-determinism challenges the learning mechanism since until now the ACS considered all changes in the environment to be caused by itself. Herein, we consequently investigate the possibilities in the ACS to enhance the predictions. We enhance the eeect part of a classiier to probability-enhanced eeects (PEEs)
منابع مشابه
Probability-enhanced Predictions in the Anticipatory Classiier System Probability-enhanced Predictions in the Anticipatory Classiier System
The Anticipatory Classiier System (ACS) recently showed many capabilities new to the Learning Classiier System eld. Due to its enhanced rule structure with an eeect part, it forms an internal environmental representation, learns latently besides the common reward learning, and can use many cognitive processes. This paper introduces a probability-enhancement in the predictions of the ACS which e...
متن کاملVoltage Sag Compensation with DVR in Power Distribution System Based on Improved Cuckoo Search Tree-Fuzzy Rule Based Classifier Algorithm
A new technique presents to improve the performance of dynamic voltage restorer (DVR) for voltage sag mitigation. This control scheme is based on cuckoo search algorithm with tree fuzzy rule based classifier (CSA-TFRC). CSA is used for optimizing the output of TFRC so the classification output of the network is enhanced. While, the combination of cuckoo search algorithm, fuzzy and decision tree...
متن کاملBiasing Exploration in an Anticipatory Learning Classifier System
The chapter investigates how model and behavioral learning can be improved in an anticipatory learning classifier system by biasing exploration. First, the applied system ACS2 is explained. Next, an overview over the possibilities of applying exploration biases in an anticipatory learning classifier system and specifically ACS2 is provided. In ACS2, a recency bias termed action delay bias as we...
متن کاملAnticipations Control Behavior: Animal Behavior in an Anticipatory Learning Classifier System
The concept of anticipations controlling behavior is introduced. Background is provided about the importance of anticipations from a psychological perspective. Based on the psychological background wrapped in a framework of anticipatory behavioral control, the anticipatory learning classifier system ACS2 is explained. ACS2 learns and generalizes on-line a predictive environmental model (a model...
متن کاملEnhanced Predictions of Tides and Surges through Data Assimilation (TECHNICAL NOTE)
The regional waters in Singapore Strait are characterized by complex hydrodynamic phenomena as a result of the combined effect of three large water bodies viz. the South China Sea, the Andaman Sea, and the Java Sea. This leads to anomalies in water levels and generates residual currents. Numerical hydrodynamic models are generally used for predicting water levels in the ocean and seas. But thei...
متن کامل